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The  title  of  this  volume  suggests  hew  far  we  have  come.  No  one  would 
ever  have  contested  the  importance  of  understanding  the  brain,  the 
interest  of  identifying  how  learning  takes  place  or  how  and  where 
memory  is  stored.  Since  Artistotle  (who  identified  heart  as  the  seat 
of  intellect  reserving  for  brain  the  function  of  cooling  the  body)  the 
brain  has  generally  been  thougit  to  be  the  source  of  thought,  the 
location  of  memory,  the  physical  basis  of  mind,  soul,  consciousness, 
and  self-awareness:  all  that  make  us  distinct  and  human.  What  has 
changed  is  the  belief,  becoming  more  and  more  prevalent,  that  the  time 
has  come.  Not  only  is  the  problem  fascinating  and  important  but,  also, 
the  tools  are  available.  We  can  do  it  now. 

For  many  years  we  have  heard  talk  about  possible  modification  of 
synapses  between  neurons  as  the  physiological  basis  of  learning  and 
memory  storage.  These  relatively  vague  ideas  are  becoming  more 
precise.  Insight  into  the  molecular  basis  of  synaptic  modification  is 
begining  to  appear;  the  role  of  possible  global  controllers  such  as 
Norepinephrine  and  Acetylcholine  is  being  clarified;  and  a  mathematical 
structure  for  the  network  of  neurons  is  rapidly  evolving. 

What  I  would  like  to  present  here  is  a  brief  summary  of  some  recent 
theoretical  and  experimental  results  related  to  plasticity  in  visual 
cortex,  and  presumably  related  to  the  changes  that  take  place  in  the 
nvervous  system  when  learning  occurs  and  when  memory  is  stored.  More 
important  than  the  details,  I  hope  to  convince  you  is  that  what  is 
being  presented  provides  us  with  a  language  in  which  questions 
concerning  memory  and  learning  can  be  discussed  with  clarity  and 


precision 


Networks  that  Remember 


1 . 


That  most  intriguing  aspect  of  human  memory:  its  persistence  in  spite 
of  continual  loss  of  individual  neurons  over  the  lifetime  of  the 
individual  has  led  many  workers  to  the  concept  of  distributed  memory. 
(Longuet- Higgins  1968;  Anderson,  1970,  1972;  Cooper,  1 974;  Pribam  et 
al.,  1971*;  Kohonen,  1977).  For  a  distributed  memory  (more  like  a 
hologram  than  a  photograph)  possesses  in  a  very  natural  way  the 
property  of  relative  invulnerability  to  the  loss  of  storage  units: 
individual  memory  sites  hold  superimposed  information  concerning  many 
events.  In  order  to  obtain  a  single  event,  information  must  be 
gathered  from  many  sites.  Loss  of  individual  units  decreases  signal  to 
noise  ratios  but  does  not  lose  items  of  information. 

Further,  in  contrast  to  modern  computers  that  perform  large  numbers  of 
sequential  operations  very  rapidly  and  very  accurately,  the  central 
nervous  system  works  slowly  and  probably  not  with  enormous  accuracy  on 
the  level  of  individual  units,  with  cycle  times  that  cannot  be  shorter 
than  a  few  milliseconds.  Howevervfye  can  make  complex  decisions  in 
anall  parts  of  a  second.  This  suggests  very  strongly  that  there  is 
much  parallel  processing  in  the  brain  -  an  idea  that  is  almost  obvious 
on  inspection  of  a  component  such  as  the  retina. 


It  is  now  commonly  thougit  that  the  synaptic  junction  may  be  a  means  to 
store  information  (memory,  for  example)  as  well  as  to  transmit  it  fran 
neuron  to  neuron.  Large  networks  of  neurons  connected  to  other  neurons 
via  modifiable  synaptic  junctions  provide  the  physiological  substrate 
for  the  distributed  parallel  systems  discussed  here. 


V-'vAV'Sl '  V-\>\ ••  '•••  \  •' 


2. 


For  a  distributed  memory  it  is  the  simultaneous  or  near  simultaneous 
activities  of  many  different  neurons  (the  result  of  external  or  internal 
stimuli)  that  is  of  interest.  Thus  a  large  spatially  distributed  pattern 
of  neuron  discharges,  each  of  which  might  not  be  very  far  from  spontaneous 
activity,  could  contain  important,  if  hard  to  detect,  information.  Let  us 
consider  the  behavior  of  an  idealized  neural  network  (that  mi^it  be 
regarded  as  a  model  component  of  a  nervous  system)  to  illustrate  some  of 
the  important  features  of  distributed  mappings. 


Consider  N  neurons  1,  2,  ...  ,  ",  each  of  which  has  some  spontaneous  firing 
rate  rj0.  (This  need  not  be  the  same  for  all  of  the  neurons,  nor  need  it 
be  constant  in  time.)  We  can  then  define  an  N-tuple  whose  components  are 
the  difference  between  the  actual  firing  rate  rj  of  the  jth  neuron  and  the 
spontaneous  firing  rate  rjo; 


By  constructing  two  such  banks  of  neurons  connected  to  one  another  (or 
even  by  the  use  of  a  single  bank  which  feeds  signals  back  to  itself), 
we  arrive  at  a  simplified  model  as  illustrated  in  Figure  1.1. 


An  Ideal  Distributed  Mapping. 

Each  of  the  N  input  neurons  in  F 
is  connected  to  each  of  the  N  output 
neurons  in  G  by  a  single  ideal  junction. 
(Only  the  connections  to  i  are  drawn.) 


The  actual  synaptic  connections  between  one  neuron  and  another  are 
generally  complex  and  redundant;  we  have  idealized  the  network  by 
replacing  this  multiplicity  of  synapses  between  axons  and  dendrites  by 
a  single  ideal  junction  which  summarizes  logically  the  effect  of  all  of 
the  synaptic  contacts  between  the  incoming  axon  branches  from  neuron  j 


in  the  F  bank  and  the  dendrites  of  the  outgoing  neuron  i  in  the  G  bank 
(Figure  1.2). 


G  8ANK  F  BANK 


Fig.  2.  An  ideal  synaptic  junction. 


Although  the  firing  rate  of  a  neuron  depends  in  a  complex  and  nonlinear 
fashion  on  the  presynaptic  potentials,  there  is  usually  a  reasonably 
well  defined  linear  region  in  which  some  very  interesting  network 
properties  are  already  evident.  We  therefore  focus  our  attention  on 
the  region  above  threshold  and  below  saturation  for  which  the  firing 
rate  of  neuron  i  in  G,  gj_  is  mapped  free  the  firing  rates  of  all  of  the 
neurons  fj  in  F  by: 


5. 


In  doing  this  we  are  regarding  as  important  average  firing  rates,  and 
time  averages  of  the  instantaneous  signals  in  a  neuron  (or  perhaps  a 
small  population  of  neurons).  We  are  further  using  the  known  integra¬ 
tive  properties  of  neurons. 

We  may  then  regard  [A^j]  (the  synaptic  strengths  of  the  N2  ideal  junctions) 
as  a  matrix  or  a  mapping  which  takes  us  frcm  a  vector  in  the  F  space  to  one 
in  the  G  space.  This  maps  the  neural  activities  f  =  (f-j^  ...fjj)  in  the  F 
space  into  the  neural  activities  g  =  (gj,  ...  gjj)  in  the  G  space  and  can  be 
written  in  the  compact  form 

g  =  Af  (3) 


It  has  been  shewn  that  the  non-local  mapping  A  can  serve  in  a  highly 
precise  fashion  as  a  memory  that  is  content  addressable  and  in  which 
'logic'  is  a  result  of  association  and  an  outcome  of  the  nature  of  the 
memory  itself.  (Cooper,  1974) 

To  illustrate  with  a  simple  example,  that  illuminates  the  content 
addressable  properties  of  the  mapping,  write: 


A  =  1  g u  x  fu. 

u 


(««) 


•  V«  V  V\*A%V-  . 


6. 


Here  gy  and  fw  are  output  and  input  patterns  of  neural  activity.  The 
symbol,  x,  represents  the  'outer*  product  between  the  input  and  output 
vectors.  Although  (4)  is  a  well-known  mathematical  form,  its  meaning  as  a 
mapping  among  neurons  deserves  some  discussion.  The  ijth  element  of  A 
gives  the  strength  of  the  ideal  junction  between  the  incoming  neuron  j  in 
the  F  bank  and  the  outgoing  neuron  i  in  the  G  bank  (Figure  2). 


Since 


u  u 

Aij  =  I  Si  fj.  (5) 

y 


the  ijth  junction  strength  is  composed  of  a  sum  of  the  entire  connectivity 
of  the  system  as  reflected  in  firing  rates  of  the  neurons  connected  to  this 
junction.  Each  association  however,  is  stored  over  the  entire  array  of  N  X 
N  Junctions.  This  is  the  essential  meaning  of  a  distributed  memory:  Each 
event  is  stored  over  a  large  portion  of  the  system,  while  at  any  particular 
local  point  many  events  are  superimposed. 


The  fundamental  problem  posed  by  a  distributed  memory  is  the  address  and 
accuracy  of  recall  of  the  stored  patterns. 


7. 


An  arbitrary  event,  e,  in  the  external  world  mapped  by  the  sensory 
apparatus  into  the  pattern  of  neural  activity,  f,  will  according  to  (3), 
generate  the  response,  g  =  Af,  in  G. 

(The  pattern,  f,  might  also  be  the  result  of  some  other  internal  pattern  of 
neural  activity.)  If  we  equate  recognition  with  the  strength  of  this 
response,  say  the  innter  product  (g,g),  and  if,  for  illustration,  we  define 
separated  events  as  those  that  map  into  orthogonal  vectors:  el  fi  f  q2  f2 
...  ek  fK  where 


(f*  ,  t  )  =  0  a  i  g 
1  ct  =  3 


(6) 


then  the  mapping  A  will  distinguish  between  those  events  it  contains, 
u 

the  f  ,  u  =  1,  2,  ...  K  and  other  events  separated  from  these. 


K  K 

Af2  =  l  gU  x  EU  fa  =  I  gu  (fu  ,  fa) 
u=l  U=1 


ga  if  f  "  is  one  of  the  vectors  f  1 . . .  f ^  (7) 


a 

0  if  f  is  not  one  of  these  vectors 


In  this  special  situation,  the  content  addressable,  parallel  distributed 
memory  is  as  precise  as  a  localized  memory. 

fl  -»  g1 

f2  -  g2  (8) 

fk  -  eK 

The  properties  of  such  a  memory  in  more  general  circumstances,  its 
ability  to  form  associations,  to  map  the  external  world,  to  create  an 
'animal  logic'  have  been  discussed  elsewhere.  (Cooper,  1974) 

Long  and  short-term  memory 

The  N2  junctions,  Aij  contain  the  content  of  the  distributed  memory. 

It  could  be  that  a  particular  junction  strength,  Aijf  is  composed  of 
several  different  components  with  different  lifetimes  thought  of  as 
corresponding  to  different  physiological  or  anatomical  effects  (e.g.  , 
changes  in  numbers  of  presynaptic  vesicles,  changes  in  numbers  of 
postsynaptic  receptors.  Changes  in  Ca++  levels  and/or  availability. 


anatomical  changes  such  as  might  occur  in  growth  or  shrinkage  of 
spines).  We  then  have  the  possibility  that  the  actual  memory  content 
(even  in  the  absence  of  additional  learning)  it  will  vary  with  tine. 
For  a  two- component  system  we  might  have 


Ai<(£) 


( long) 
A1J 


(t)  f  A 


(short) 

ij 


(t) 


(9) 


( t )  (long). 

where  A  ^  .  represents  the  memory  at  some  time  t.  while  «.  .  .  ana 

A  —  hor^ave  long  and  snort  lifetimes.  Thus  in  time  A  ,  '"ill  decay 
1 J  ‘  ! 


leaving  A(t)  =  A--°“\;  Whether  what  is  in  the  short-term  memory 
^  J 


component  is  transferred  to  the  long-term  component  mignt  he  determir, 
by  some  global  3igr.al-  depending  on  the  interest  cf  the  inornation 
contained  in  the  short-term  component. 


Fran  this  point  of  view  the  site  of  long  and  short-term  memory  can  be 
essentially  identical.  At  any  given  time  there  is  a  single  memory. 

The  distinction  between  long  and  short-term  memory  is  contained  in  the 


lifetime  of  the  different  components  of  Ai  •; . 


10. 


Networks  That  Learn 


We  now  ask  hew  a  napping  of  the  type  A  might  be  put  into  the  network.  The 
ijth  element  of  A, 


A.  . 
ij 


=  Z 

MV 


c 

MV 


i  j 


(10) 


is  a  weighted  3um  over  the  j  components  of  all  mapped  signals  f  J  and 

the  i  components  of  the  responses  g  appropriate  for  recollection  or 

M 

association.  Such  a  form  could  be  obtained  by  additions  with  each 
input  f  and  output  g  to  the  element  A ^ j : 


5  A.  .  'v  g.  f  , 
i.l  i  J 


(11) 


This  6  Aj_j  i3  proportional  to  the  product  of  the  differences  between 
the  actual  and  the  spontaneous  firing  rates  in  the  pre-and  postsynaptic 
neurons  i  and  j.  [This  is  one  realization  of  Hebb’s  form  of  synaptic 

modification  (Hebb,  1949).]  The  addition  of  such  changes  to  A  for  all 

1]  v 

associations  g  X  f  results  finally  in  a  mapping  with  the 
properties  discussed  in  the  previous  section. 


Synaptic  modification  dependent  on  inputs  alone,  of  the  type  already 
directly  observed  in  Aplysia  (Kandel,  1976),  is  sufficient  to  construct  a 
simple  memory-one  that  distinguishes  what  has  been  seen  frcm  what  has  not, 
but  does  not  easily  separate  one  input  frcm  another.  To  construct  a 
mapping  of  the  form  above,  however,  requires  synaptic  modification 
dependent  on  information  that  exists  at  different  places  on  the  neuron 
membrane-what  we  call  two-(or  higher-)point  modification. 


In  order  that  this  take  place,  information  must  be  communicated  frcm,  for 
example,  the  axon  hillock  to  the  synaptic  junction  to  be  modified.  This 
implies  the  existence  of  a  means  of  internal  communication  of  information 
within  a  neuron-in  the  above  example,  in  a  direction  opposite  to  the  flew 
cf  electrical  signals  (Cooper,  1974).  The  junction  ij,  for  example,  must 
have  information  of  the  firing  rate  f j  (which  is  locally  available)  as  well 
as  the  firing  rate  g^f  which  is  somewhat  removed  (Figure  3).  One 
possibility  could  be  that  the  integrated  electrical  signals  frcm  the 
dendrites  produce  a  chemical  or  electrical  response  in  the  cell  body  which 
controls  the  spiking  rate  of  the  axon  and  at  the  same  time  communicates  (by 
backward  spiking,  for  example)  to  the  dendrite  ends  the  information  of  the 
integrated  slow  potential. 


rw. 


INFORMATION  FLOW 
SIGNAL  FLOW 


Figure  3. 


Two  Point  Modificati 


13. 


Summary  of  Related  Visual  Cortex  Experimental  Data 

The  discussion  above  leads  to  a  central  issue:  what  is  the  principle  of 
local  organization  that,  acting  in  a  large  network,  can  produce  the 
observed  complex  behavior  of  higher  mental  processes.  There  is  no  need  to 
assume  that  such  a  mechanism  —  believed  to  involve  synaptic  modification 
—  operates  in  exactly  the  same  manner  in  all  portions  of  the  nervous 
system  or  in  all  animals.  However,  one  would  hope  that  certain  fundamental 
similarities  exist  so  that  a  detailed  analysis  of  the  properties  of  this 
mechanisn  in  one  preparation  would  lead  to  some  conclusions  that  are 
generally  applicable.  We  are  interested  in  visual  cortex  because  the  vast 
amount  of  experimental  work  done  in  this  area  of  the  brain  —  particularly 
area  17  of  cat  and  monkey — strongly  indicate  that  one  is  observing  a 
process  of  synaptic  modification  dependent  on  the  information  locally  and 
globally  available  to  the  cortical  cells. 

Experimental  work  of  the  last  generation,  beginning  with  the  path-breaking 
work  of  Hubei  and  Wiesel  (  1959,  1962),  has  shown  that  there  exist  cells  in 
visual  cortex  (areas  17,  18,  and  19)  of  the  adult  cat  that  respond  in  a 
precise  and  highly  tuned  fashion  to  external  patterns — in  particular  bars 
or  edges  of  given  orientation  and  moving  in  a  given  direction.  Much 
further  work  (Blakemore  and  Cooper,  1970;  Blakemore  and  Mitchell,  1973; 
Hirsch  and  Spinelli,  1971;  Pettigrew  and  Freeman,  1973)  has  been  taken  to 


14 


indicate  that  the  number  and  response  characteristics  of  such  cortical 
cells  can  be  modified.  It  has  been  observed  in  particular  (Imbert  and 
Buisseret,  1975;  Blakemore  and  Van  Sluyters,  1975;  Buisseret  and  Imbert, 
1976;  and  Fregnac  and  Imbert,  1977,  1978),  that  the  relative  number  of 
cortical  cells  that  are  highly  specific  in  their  response  to  visual 
patterns  varies  in  a  very  striking  way  with  the  visual  experience  of  the 
animal  during  the  critical  period. 

Most  kittens  first  open  their  eyes  at  the  end  of  the  first  week  after 
birth.  It  is  not  easy  to  assess  whether  or  not  orientation  selective  cells 
exist  at  that  time  in  striate  cortex:  few  cells  are  visually  responsive 
and  the  response's  main  characteristics  are  generally  "sluggishness"  and 
fatigability.  However,  it  is  quite  generally  agreed  that  as  soon  as 
cortical  cells  are  reliably  visually  stimulated  (e.g. ,  at  2  weeks),  some 
are  orientation  selective,  whatever  the  previous  visual  experience  of  the 
animal  (eg.  Hubei  and  Wiesel,  1963;  Blakemore  and  Van  Sluyters,  1975; 
Buisseret  and  Imbert,  1976;  Fregnac  and  Imbert,  1978). 

Orientation  selectivity  develops  and  extends  to  all  visual  cells  in  area  17 
if  the  animal  is  reared,  and  behaves  freely,  in  a  normal  visual  environment 
(NR):  complete  "specification"  and  normal  binocular! ty  (about  80J  of 
responsive  cells)  are  reached  at  about  6  weeks  of  age  (Fregnac  and  Imbert, 
1978).  However,  if  the  animal  is  reared  in  total  darkness  from  birth  to 
the  age  of  6  weeks  (DR),  none  or  few  orientation  selective  cells  are  then 


recorded  (frcm  0  to  15?  depending  on  the  authors  and  the  classification 
criteria);  hcwever,  the  distribution  of  ocular  dominance  seems  unaffected 
(Blakemore  and  Mitchell,  1973;  Imbert  and  Buisseret,  1975;  Blakemore  and 
Van  Sluyters,  1975;  Buisseret  and  Imbert,  1976;  Leventhal  and  Hirsch,  1980; 
Fregnac  and  Imbert,  1978).  In  animals  whose  eyelids  have  been  sutured  at 
birth,  and  which  are  thus  binocularly  deprived  of  pattern  vision  (BD),  a 
somewhat  higher  proportion  (from  12  to  50?)  of  the  visually  excitable  cells 
are  still  orientation  selective  at  6  weeks  (and  even  beyond  24  months  of 
age)  and  the  proportion  of  binocular  cells  is  less  than  normal  (Wiesel  and 
Hubei,  1965;  Blakemore  and  Van  Sluyters,  1975;  Kratz  and  Spear,  1976; 
Leventhal  and  Hirsch,  1977;  Watkins,  et  al.,  1978). 

Of  all  visual  deprivation  paradigms,  putting  one  eye  in  a  competitive 
advantage  over  the  other  has  probably  the  mo3t  striking  consequences.  If 
monocular  lid-suture  (MD)  is  performed  during  a  "critical"  period  (ranging 
frcm  about  3  weeks  to  about  12  weeks),  there  is  a  rapid  loss  of 
binocularity  to  the  profit  of  the  open  eye  (Wiesel  and  Hubei,  1963,  1965). 
At  this  stage,  opening  the  closed  eye  and  closing  the  experienced  one  may 
result  in  a  complete  reversal  of  ocular  dominance  (Blakemore  and  Van 
Sluyters,  1974).  A  disruption  of  binocularity  that  does  not  favor  one  of 
the  eyes  may  be  obtained,  for  example,  by  provoking  an  artifical  strabismus 
(Hubei  and  Wiesel,  1965)  or  by  an  alternating  monocular  occlusion,  which 
gives  both  eyes  an  equal  amount  of  visual  stimulation  (Blakemore,  1976). 

In  what  follows,  we  call  this  uncorrelated  rearing  (UR). 


These  results  seem  to  us  to  provide  direct  evidence  for  the  modifiability 
of  the  response  of  single  cells  in  the  cortex  of  a  higher  mammal  according 
to  its  visual  experience.  Depending  on  whether  or  not  patterned  visual 
information  is  part  of  the  animal's  experience,  the  specificity  of  the 
response  of  cortical  neurons  varies  widely.  Specificity  increases  with 
normal  patterned  experience.  Deprived  of  normal  patterned  information 
(dark-reared  or  lid-sutured  at  birth,  for  example)  specificity  decreases. 
Further,  even  a  short  exposure  to  patterned  information  after  six  weeks  of 
dark-rearing  can  reverse  the  loss  of  specificity  and  produce  an  almost 
normal  distribution  of  cells. 

We  do  not  claim  and  it  is  not  necessary  that  all  neurons  in  visual  cortex 
be  so  modifiable.  Nor  is  it  necessary  that  modifiable  neurons  are 
especially  important  in  producing  the  architecture  of  visual  cortex.  It  i 
our  hope  that  the  general  form  cf  modifiability  we  require  to  construct 
distributed  mappings  manifests  itself  for  at  least  some  cells  of  visual 
cortex  that  are  accessible  to  experiment.  We  thus  make  the  conservative 
assumption  that  biological  mechanisms,  once  established  will  manifest 
themselves  in  more  or  less  similar  forms  in  different  regions.  If  this  is 
the  case,  modifiable  individual  neurons  in  visual  cortex  can  provide 
evidence  for  such  modification  more  generally. 


Here  d  d  ,  m  ,  m  are  inputs  and  synaptic  junctions  from  left  and 
right  eyes.  The  output  of  this  neuron  (in  the  linear  region)  can  be 


written 


c 


.1  _r  .  r 

in  .  d  +  m  .  d 


(12) 


This  means  that  the  neuron  firing  rate  (in  the  linear  region)  is  the 
sum  of  the  inputs  from  the  left  eye  multiplied  by  the  appropriate 
left-eye  synaptic  weights  plus  the  inputs  frco  the  right  eye  multiplied 
by  the  appropriate  right-eye  synaptic  weights.  Thus  the  neuron 
integrates  signals  free  the  left  and  right  eyes. 

According  to  the  theory  presented  by  Bienenstock,  Cooper  and  Munro, 

1932  (BCM)  these  synaptic  weights  modify  as  a  function  of  local  and 
global  variables.  To  illustrate  we  consider  the  synaptic  weight  mj» 

Its  change  in  time,  mj  ,  is  given  below: 

mj  =  F(dj  ...mj  ;dk  ..c;  c  ...;  y,Y-Z). 


Here  variables  such  as  dj  ...  mj  are  designated  local.  These 
represent  information  (such  as  the  incoming  signal,  dj  ,  and  the 
strength  of  the  synaptic  junction,  mj)  available  locally  at  the 
synaptic  junction,  mj.  Variables  such  as  dj{  ...c  are  designated 
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quasi- local .  These  represent  information  (such  as  c,  the  firing  rate 
of  the  cell,  or  d^,  the  incoming  signal  to  another  synaptic  junction) 
that  is  not  locally  available  to  the  junction  mj  but  is  physically 
connected  to  the  junction  by  the  cell  body  itself — thus  necessitating 
some  form  of  internal  communication  between  various  parts  of  the  cell 
and  its  synaptic  junctions.  Variables  such  as  c  (the  time  averaged 
output  of  the  cell)  are  averaged  local  or  quasi-local  variables. 

Global  variables  are  designated  X, Y, Z...  These  latter  represent 
information  (e.g.  presence  or  absence  of  neurotransmitters  such  as 
norepinephrine  or  the  average  activity  of  large  numbers  of  cortical 
cells)  that  is  present  in  a  similar  fashion  for  all  or  a  large  number 
of  cortical  neurons  (distinguished  f ran  local  or  quasi-local  variables 
presumably  carrying  detailed  information  that  varies  frcm  synapse  to 
synapse) . 

In  a  form  relevant  to  this  discussion,  BCM  modification  can  be  written 

mj  =<Kc,c;  X.Y-Z.  ..)dj  (14) 

so  that  the  j synaptic  junction,  mj,  changes  its  value  in  time  as 
a  function  of  quasi-local  and  time-averaged  quasi- local  variables,  c 
and  c,  as  well  as  global  variables  X, Y,Z,  through  the  function,  ,  and 
a  function  of  the  local  variable  dj.  The  crucial  function,  $  ,  is 
shown  in  Figure  (5). 


Fig.  (5) 

The  BCM  Modification  Function 


What  is  of  particular  significance  is  the  change  of  sign  of  >>  at  the 

codification  threshold,  ~  >,],  and  the  non-linear  variation  of  -  f.j  with  the 
average  output  of  the  cell  c.  In  a  simple  situation 


°M  =  (c)2 


The  occurance  of  negative  and  positive  regions  for  $  drives  the  cell 
to  selectivity  in  a  'normal*  environment.  This  is  so  because  the 
response  of  the  cell  is  diminished  to  those  patterns  for  which  the 
output,  c,  is  below  threshold  ( $  negative)  while  the  response  is 
enhanced  to  those  patterns  for  which  the  output,  c,  is  above  threshold 
( positive).  The  non-linear  variation  of  the  threshold  with  the 
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average  output  cf  the  cell,  c,  places  the  threshold  so  that  it 
eventually  separates  one  pattern  from  all  of  the  rest.  Further  it 
provides  the  stability  properties  of  the  system. 

A  detailed  analysis  of  the  consequences  of  this  form  of  modification  is 
given  in  BCM.  The  results  (as  modified  in  the  network  analysis 
outlined  next)  are  in  general  agreement  with  what  we  might  call 
classical  experiments  of  the  last  generation.  Neurons  in  normal 
(patterned  environments)  become  selective  and  binocular.  In  various 
deprived  environments  (e.g.  monocular  or  binocular  deprivation)  the 
theoretical  behavior  fellows  the  experimental  results. 


Extension  to  Networks 

To  better  confront  these  ideas  with  experiment  the  single  neuron  dis¬ 
cussed  above,  must  be  placed  in  a  network  with  anatomical  and  physiolo¬ 
gical  features  of  the  region  of  interest.  For  visual  cortex  this 
suggests  a  network  in  which  inhibitory  and  excitatory  cells  receive 
input  fran  the  lateral  geniculate  nucleus  (LGN)  and  from  each  other.  A 
simplified  fora  of  such  a  network,  a  first-order  representation  of  the 
anatomy  and  physiology  of  layer  IV  of  cat  visual  cortex  (Figure  6)  has 
been  studied  by  Scofield  and  Cooper,  (in  Press). 


In  a  network  generalization  of  Eq.  (12),  we  write 


(16) 


where  L^j  are  the  intra cortical  connections 


A  simplified  neural  network. 

Shewn  are  the  two  cell  types:  inhibitory 
represented  by  circles,  and  excitatory  by 
diamonds.  Geniculate  afferents  enter  at  the 
top  of  the  figure  and  synapse  with  all  cells 
in  the  network  at  the  intersection  of  the 
horizontal  and  vertical  fibers.  Also  shewn  are 
intracortical  fiber  for  each  cell  type.  The 
exact  ratio  of  inhibitory  to  excitatory  cells  is 
not  important. 


Analysis  by  Scofield  and  Cooper  of  the  network  along  lines  similar  to 
that  of  the  single  cell  analysis  described  above  shews  that  under 
proper  conditions  on  the  intracortical  synapses,  the  cells  converge  to 
states  of  maximum  selectivity  with  respect  to  the  environment  formed  by 
the  geniculate  signals.  Their  conclusions  are  therefore  similar  to 
those  of  BCM  with  explicit  further  statements  concerning  the 
independent  effects  of  excitatory  and  inhibitory  neurons  on  selectivity 
and  ocular  dominance.  For  example,  shutting  off  inhibitory  cells 
lessens  selectivity  and  alters  ocular  dominance.  The  inhibitory  cells 
may  be  selective  but  there  is  no  theoretical  necessity  that  they  be  so. 

A  mean  field  approximation  to  the  above  network  (Cooper  and  Scofield, 
to  be  published)  shews  that  if  the  average  effect  of  intracortical 
connections  results  in  inhibition  of  individual  cells,  then  in 
monocular  deprivation,  the  geniculo cortical  synapses  to  the  cell  will 
converge  to  non-zero  states  that  give,  as  the  result  of  stimulation  of 
the  closed  eye,  total  responses  that  are  zero.  However,  the  fact  that 
the  geniculocortical  states  are  non-zero  means  that  the  removal  of 
cortical  inhibition  through  the  chemical  blocking  of  inhibitory 
synapses  would  uncover  responses  from  previously  non- responsive  cells. 
This  result  is  in  accord  with  the  experimental  observation  of  'masked 
synapses'  after  the  removal  of  the  inhibitory  effects  of  GABA  with  the 
blocking  agent  bicuculline  (Duffy  et  al.  1976,  Sillito  et  al.  1980). 
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An  unexpected  consequence  of  this  theory  is  a  connection  between  selec¬ 
tivity  and  ocular  dominance.  The  analysis  given  in  BCM  and  extended  in 
the  mean- field  network  theory  shews  that  in  monocular  deprivation, 
non-preferred  inputs  presented  to  the  open  are  a  necessary  part  of  the 
suppression  of  deprived  eye  responses.  It  follows  that  the  more 
selective  the  cell  is  to  the  open  eye  (increasing  the  probability  of 
non-preferred  inputs)  the  more  closed  eye  will  be  driven  to  zero,  thus 
increasing  the  dominance  of  the  open  eye. 

For  an  experimental  test  of  these  ideas  it  is  important  to  determine 
what  happens  during  the  ocular  dominance  shift  produced  by  monocular 
deprivation.  Consider  the  experimental  situation  in  which  monocular 
experience  follows  a  period  of  dark  rearing.  [Such  experiments  are 
presently  being  performed  by  Saul  and  Daniels  (private  communication)]. 
During  dark  rearing  it  is  known  that  most  area  17  cells  become  less 
responsive  (sluggish)  and  lose  their  selectivity  and  that  some  (perhaps 
20?)  become  visually  non- responsive. 

Our  theoretical  analysis  indicates  that  in  the  course  of  monocular  ex¬ 
perience,  those  cells  that  have  become  visually  non-responsive  during 
the  dark  rearing  will  first  shew  an  increase  in  responsiveness  to  the 
open  eye  followed  by  the  development  of  selectivity.  Those  cells  that 
have  survived  the  period  of  dark  rearing  binocular  and  a-specific  will 
exhibit  a  progression  in  which  selectivity  to  the  open  eye  is  in¬ 
creased  while  maintaining  their  response  (often  non- selective)  to  the 
closed  eye.  This  should  result  in  two  sequences  shown  in  figures  7a 


and  7b. 
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Fig.  (7a) 

Progression  of  development  of  selectivity  and 
ocular  dominance.  Note  that  selectivity  develops 
for  responsive  binocular  aspecific  cells  for  the 
open  eye  before  the  response  to  the  closed  eye  is 
driven  to  zero. 


OPEN 


CLOSED 


INITIAL 


Fig.  (7b) 

Progression  of  development  of  re¬ 
sponsiveness  and  selectivity.  Note 
that  responsiveness  to  the  open  eye 
develops  before,  or  along  with, 
selectivity. 


These  results  are  obtained  without  assuming  that  the  intracortical  in¬ 
hibitory  synapses  are  very  responsive  to  visual  experience.  Learning 
can  occur  entirely  among  the  excitatory  LGN-cortical  synapses.  Another 
point  of  view  is  espoused,  for  example,  by  Rauschecker  and  Singer 
(1981).  They  suggest  that  since  cells  lose  their  orientation 
specificity  when  binocularly  deprived  of  pattern  vision,  and  since  it 
has  been  shown  that  inhibitory  connections  play  a  major  role  in 
determining  orientation  selectivity.  (Sillito,  1Q?5),  the  cortical 
inhibitory  synapses  must  suffer  more  than  excitatory  ones. 

These  conflicting  ideas  led  us  to  perform  experiments  on  changes  in 
inhibitory  activity  due  to  visual  experience  (Bear  et  al. 1Q85),  and 
described  in  the  next  section.  They  indicate  that  one  measure  of 
cortical  inhibition  is  relatively  constant  even  during  dramatic  mani¬ 
pulations  of  the  visual  environment. 
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Experimental  Test  of  Changes  in  Inhibitory  Activity 
Due  to  Visual  Experience 

One  of  the  consequences  of  the  network  theory  discussed  in  the  previous 
section, is  that  experimental  results  that  have  been  obtained  in  visual 
cortex  over  the  last  generation  can  be  explained  primarily  by 
modification  of  lateral  geniculate  (LON)  to  cortex  synapses  with 
minimun  changes  among  intra-cortical  synapses.  Thus  the  possibility  is 
opened  that  most  learning  takes  place  in  the  LON  synapses.  This 
somewhat  surprising  result  has  as  one  consequence  the  possibility  of 
great  simplification  in  the  analysis  of  network  modification. 

An  alternate  hypothesis  that  has  been  considered  for  some  time  is  that 
intracortical  synapses  bear  heavy  responsibility  for  modification  in 
cortical  circuitry  during  learning.  In  particular  it  has  been 
suggested  that  ocular  dominance  shifts  in  monocular  deprivation  are  due 
to  increased  activity  of  GABAergic  neurons,  the  open  eye  supressing  the 
closed.  Sillito  (1975)  documented  in  normal  cats  that  visually 
unresponsive  cells  may  be  "unmasked"  by  icntophoretic  bicuculline. 

Thus,  it  is  not  unreasonable  to  speculate  that  many  of  the  unresponsive 
cells  in  visually  deprived  kittens  are  being  suppressed.  Together, 
these  data  suggest  as  a  possible  hypothesis  that  in  kitten  striate 
cortex  the  GABAergic  neurons  respond  to  sensory  deprivation  by  forming 
new  synapses.  This  hypothesis  implies  that  the  density  or  strength  of 
GABAergic  synapses  will  increase  in  zones  of  cortex  that  are  deprived 
of  a  normal  thalamic  input;  in  the  case  of  monocular  deprivation,  these 
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zones  correspond  to  the  closed-eye  ocular  dominance  columns  and  to  the 
monocular  segment  contralateral  to  the  deprived  eye.  On  the  other 
hand,  the  theory  described  above  suggests  that  there  will  be  minimal 
response  of  GABAergic  neurons  to  sensory  deprivation.  This  hypothesis 
has  been  put  to  test  in  a  recent  series  of  experiments  (Bear  et  al. 
1985). 

To  examine  the  distribution  of  GABAergic  synapses,  Bear  et  al 
immunocy  to  chemically  localized  GAD  in  sections  of  striate  cortex. 

While  immuno cytochemistry  is  not  a  Quantitative  measure,  they  reasoned 
that  changes  restricted  to  deprived  ocular  dominance  zones  should  be 
readily  detected  with  this  method.  As  a  quantitative  estimate  of 
GABAergic  synapse  density,  they  biochemically  measured  GAD  activity  in 
homogenates  of  striate  cortex. 

They  found  no  evidence  for  a  change  in  the  distribution  of  GAD  positive 
puncta  in  12  unilaterally  enucleated  kittens.  The  band  of  layer  I V 
puncta  remained  uniform  even  though  the  periods  of  monocular 
deprivation  examined  would  all  be  sufficient  to  cause  a  physiological 
ocular  dominance  shift  in  striate  cortex  GAD  immunoreactivity  was 
unchanged  even  under  conditions  that  produced  alterations  in  the  level 
of  the  metabolic  enzyme,  cytochrome  oxidase.  Measurements  of  GAD 
activity  showed  no  consistent  or  significant  difference  between  either 
the  binocular  segments  of  enucleated  and  control  kittens,  or  the 
monocular  segments  of  enucleated  animals. 
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This  conclusion  is  in  striking  agreement  with  network  analysis  which, 
as  mentioned  above,  suggests  that  inhibitory  synapses  are  much  less 
modified  by  experience  than  excitatory  synapses.  In  addition  to  its 
implications  for  the  'site  of  learning*  such  a  hypothesis  leads  to 
important  simplifications  in  the  analysis  of  complex  networks. 


Possible  Candidates  for  Global  Controllers 


Any  theory  of  learning  generally  requires  global  as  well  as  local 
controls.  As  discussed  previously,  local  controls  are  those  that 
determine  detailed  changes  of  individual  synaptic  junctions  while 
global  information  would  be  expected  to  influence  all  or  large  numbers 
of  synaptic  junctions  in  the  same  way.  From  the  point  of  view  of  a 
learning  theory,  there  must  be  some  way  to  distinguish  more  interesting 
frcm  less  interesting  input.  Experimentally  it  is  known  that  certain 
areas  of  cortex  (visual,  auditory,  somatic  sensory)  exhibit  plasticity 
during  early  critical  periods,  but  not  during  adulthood. 

For  example,  most  neurons  in  the  visual  cortex  of  newborn  kittens  (and 
normal  adult  cats  as  well)  are  activated  equally  well  by  both  eyes. 
During  the  critical  period  (three  weeks  to  three  months  in  cats), 
monocular  lid  suture  or  misalignment  of  the  eyes  (called  strabismus) 
leads  to  the  domination  of  cortical  cells  by  one  eye.  In  contrast, 
adult  cats,  that  are  given  monocular  visual  experience  when  they  are 
older  than  six  months  of  age,  remain  unaffected  by  the  imbalance  of 
visual  inputs;  adult  cortical  neurons  remain  binocularly  activated. 


A  major  question  can  be  formulated  as  follows:  What  are  the  global 


factors  acting  singly  or  in  combination  that  affect  the  development  of 
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In  recent  years  it  has  been  suggested  that  catecholamines  (CA)  are 
required  for  neuronal  plasticity  in  the  neocortex.  One  test  of  this 
hypothesis  has  been  made  in  a  series  of  experiments  performed  by 
Kasamatsu  and  Pettigrew  (1976,  1979)  and  Kasamatsu  et  al.  (1979,  1981) 
who  used  the  monocular  deprivation  paradign  with  kitten  visual  cortex 
as  a  test  system.  In  their  control  kittens  they  found  the  usual  effect 
of  monocular  deprivation  during  the  critical  period— -within  a  week  the 
majority  of  cells  lost  their  normal  binocular  responsiveness  and  could 
be  driven  only  by  stimulation  of  the  non-deprivea  eye.  But,  in  animals 
given  the  neurotoxin  6-hydroxydopamine  (6-OHDA)  to  deplete  cortical 
CAs,  the  ocular  dominance  shift  failed  to  occur  and  cells  remained 
binocularly  driven. 

In  later  experiments  Kasamatsu  and  Pettigrew  pioneered  the  use  of 
miniature  osmotic  pumps  to  infuse  6-OHDA  continuously  to  local  regions 
of  cortex  in  one  hemisphere  while  they  used  the  other  hemisphere  as  a 
control  by  perfusing  only  the  vehicle  for  the  6-CHDA.  Following 
monocular  deprivation,  normal  plasticity  was  again  disrupted  in  CA 
depleted  cortex  as  indicated  by  the  lack  of  a  shift  in  ocular  dominance 
of  visual  cells.  Because  both  noradrenergic  and  dopaminergic  fibers 
project  to  the  visual  cortex,  Kasamatsu  et  al.  also  used  the  minipumps 
to  add  norepinephrine  (NE)  to  the  cortex  previously  depleted  of  CAs  to 
demonstrate  that  NE,  itself  can  restore  plasticity.  This  evidence 
suggests  that  catecholamines,  especially  NE,  are  necessary  for 
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the  cortical  changes  observed  in  kittens  which  have  restricted  vision 
during  the  critical  period. 

However,  the  relationship  between  static  NE  levels  and  plasticity  is 
not  simple.  In  one  experiment,  ^Bear  and  Daniels.  1983;  Rear  et  al. 
1Q8H  cortical  catecholamines  were  permanently  depleted  in  newborn 
kittens  by  intra  peritoneal  injections  of  6-OHDA.  Biochemical  analysis 
demonstrated  severe  reduction  of  NE  levels;  but  in  this  experiment 
plasticity,  as  evidenced  by  ocular  dominance  shift,  remained  intact. 

In  the  neonatal  experiments  kittens  received  6-OHDA  a_t  birth  and  were 
monocularly  deprived  fcr  7  days  at  about  5  weeks  of  age.  The 
mini  pump-kittens  received  6-OHDA  continuously  for  the  7  days  of 
mcnocular  deprivation.  Comparison  of  the  two  paradigns  suggests  the 
possibility  that  loss  of  plasticity  is  not  caused  by  depletion  of  NR 
alone. 

Thus,  despite  Kasamatsu's  success  in  demonstrating  support  for  the  NE 
hypothesis,  questions  have  arisen.  The  above  experiments  shew  that 
early  depletion  of  cortical  NE  does  not.  by  itself  prevent  the  later 
ocular  dominance  shift  after  mcnocular  deprivation.  Other  experiments 
confirm  this  conclusion.  Daw  et  al .  ( •» 984 )  depleted  cortical  NE  by 
section  of  the  locus  cceruleus  fiber  bundle  near  lateral  hypothalmus 
and  found  no  diminution  of  the  ocular  dominance  shift  after  monocular 
deprivation.  Videen  et  al.(1984)  recorded  no  difference  in  the 
reaction  of  kitten  and  adult  cat  visual  cortex  neurons  to 
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iontophoretically  applied  NE.  Adrien  et  al.(1982)  observed  no  lack  of 
shift  after  lesion  of  the  locus  eoeruleus  itself;  and  that  group  was 
unable  to  reproduce  Kasamatsu  and  Pettigrew's,  1979-  finding  that 
intra-ventricular  injection  of  6-OHDA  prevents  ocular  dominance  shift. 
All  of  these  results  reinforce  the  idea  that  NE  is  not  the  only  factor 
in  the  global  control  of  learning. 

In  addition  to  the  norepinephrine  (NE)  system,  several  lines  of 
evidence  suggest  that  the  cholinergic  (AQi)  system  may  also  serve  as  a 
global  modulator  of  cortical  function.  Similar  to  the  locus 
coeruleus-NE  system,  the  basal  forebrain  cholinergic  (ACh)  system  has  a 
widespread  input  to  cortex  that  stands  in  marked  contrast  to  the  highly 
organized  thalamocortical  systems  that  provide  specific  sensory  input 
to  cortex.  In  addition  several  findings  link  both  the  NE  and  ACh 
systems  to  acquisition  and  storage  processes  related  to  learning  and 
memory. 


While  these  global  cortical  inputs  have  been  related  to  memory  and 
learning,  the  specific  cellular  mechanisms  of  ACh  and  NE  function  are 
unclear.  Present  evidence  indicates  that  both  systems  may  modulate  the 
response  of  cortical  neurons  to  specific  sensory  inputs.  NE  appears  to 
improve  the  signal  to  noise  ratio  of  sensory  responses  both  in  somatic 
sensory  and  in  the  primary  visual  cortex  and  NE  may  potentiate  the 
action  of  both  excitatory  and  inhibitory  transmitters.  These  effects 
of  NE  are  mediated  through  alpha  adrenergic  receptors  which  appear  to 
be  concentrated  in  the  deeper  layers  (I^-VT)  of  cortex. 
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The  cholinergic  system  may  function  in  a  manner  similar  to  NE. 
Application  of  lew  levels  of  ACh  enhances  the  excitatory  response  of 
cortical  neurons  to  glutamate  and  modifies  the  task  related  discharge 
of  cortical  neurons  during  behavior.  ACh  has  also  been  shown  to  modify 
the  membrane  incut  resistance  of  cortical  neurons  for  at  least  1.5 
hours.  The  slew  onset,  long  duration  of  action,  and  sensitizing 
effects  of  ACh  are  all  consistent  with  the  conclusion  that  the 
cholinergic  system  is  mainlv  a  modulator  of  cortical  activity.  Thus, 
the  evidence  to  date  suggest  that  both  HE  and  ACh  may  play  a  modulatory 
role  in  cortex. 

This  view  has  been  reinforced  by  recent  work  of  Bear  and  Singer 
(Private  communication)  which  indicates  that  6-GHDA.  in  addition  to 
destroying  neurons,  also  competitively  binds  with  muscarinic 
cholinergic  receptors  to  prevent  ACh  effects  on  cortical  neurons.  In 
addition  Bear  and  Singer  have  shewn  that  the  simultaneous  diminution  of 
both  ACh  and  NE  appears  to  prevent  the  ocular  dominance  shift  in 
monocular  deprivation  while  the  diminution  of  either  ACh  or  NE  (and  not 
both)  does  not  prevent  this  shift. 

These  results  may  enable  a  resolution  of  the  apparent  contradictions  in 
previous  NE  experiments  discussed  above  and,  in  addition,  suggest  the 
fascinating  possibility  that  ACh  and  NE  act  together  to  provide  a 
global  modulator  for  plasticity. 


v  v. 
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